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1. INTRODUCTION 


There are several ways of approaching an optimal control problem 
by means of the Calculus of Variations. Perhaps the most well known 
technique, described in a paper by Berkovitz (X), involves adjoining 
additional variables to the system, commonly called slack variables, 
in order to transform inequality constraints into differential equation 
constraints. In the new problem all variables are unrestricted; hence, 
the classical theory can be applied. Necessary conditions for the 
Bolza problem are then translated into necessary conditions for optimal 
control. These conditions include the maximum principle of Pontryagin 
(20) which is seen to be a consequence of the Euler Lagrange equations 
and the Weierstrass necessary condition. In a paper (12) and in his 
book (13), Hestenes used a similar method to sti^dy optimal control 
problems. This method of slack variables has been widely used by a 
number of authors for some time. In feet, as early as 1937, F. A. 
Valentine (26) applied it to Lagrange problems with differential 
inequalities as added side conditions. 

Later Berkovitz (2) applied this technique to problems with 
bounded state variables. He obtains essentially the results of 
Gamkrelidze in Chapter VI of (20) for the nonlinear problem with 
inequality constraints involving only state variables. Hestenes* 
results were extended by Russak (21) and Guinn (10) to include the 
bounded state case. 

Another approach, described by Kalman (15), uses the Hamiltoniap 
theory of the Calculus of Variations as it was developed by 

i 
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Caratheodory (5) * Sagan also utilizes this method in his book (23) 
where the maximum principle is shown to follow from Caratheodory f s 
lemma in a rather simple way. However, as is pointed out by the author, 
the usefulness of this technique is quite limited due to the fact that 
unnecessary assumptions must be made regarding the differentiability of 
Hamilton’s characteristic function and the existence of an admissible 
set of inception. 

In this thesis a technique described by Park in (18) and (19) and 
applied to a simple problem by this author in (11) is to be utilized to 
study properties of solutions to various general problems in Optimal 
Control Theory, in particular those with bounded state variables. By 
means of an appropriate transformation of variables, optimal control 
problems are converted into Lagrange problems of the Calculus of 
Variations. This is accomplished by using mappings satisfying certain 
properties which take some euclidean space onto closed control and 
state regions. 

Of course, the fundamental question must be considered as to 
under what conditions is the transformed problem equivalent to the 
original one. That is, one must know that solutions to the new problem 
lead to solutions of the old one and vice versa. This equivalence 
question is discussed in great generality by Park in (18); in this paper 
it will only be considered in relation to the particular problems 
investigated. 

In chapter 2, the general problem and the method used are rigor- 
ously stated. The concept of equivalence is defined and a simple 
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equivalence criterion is stated and proven. The particular transfor- 
mation to be used herein is exhibited and its use is justified. Also 
included is a survey of the literature concerning bounded state 
prob lems. 

The non-linear control problems with the unit m-cube as control 
region and the unit n-cube as state region are considered in Chapter 
3. > Results similar to those of Berkovitz (2) are obtained as conse- 
quences of the well known necessary conditions for the problem of 
Lagrange. The translate to the optimal control setting of the 
hypothesis regarding the rank of the matrix of partials of the 
constraining equations with respect to the derivatives of the variables 
involved is obtained, and a new second order necessary condition 
analogous to the Clebsch condition is developed. In addition, the 
question of the singularity of optimal subarcs along boundaries of the 
state region is investigated, as is the behavior of solutions at 
boundary points where the optimal trajectory either enters or leaves 

the state boundary, 

( 

In Chapter 4 the results of Chapter 3 are applied to a simple 
example for which solutions are completely characterized. The problem 
is that of finding a solution to the differential equation k = u from 
some fixed starting point to the origin in minimal time under the 
restriction that |u| _< 1 and including |x| 1 and |x| <_ 1 as state 

constraints. This chapter also serves as a preview to Chapter 5 in 
which the general linear time optimal control problem is considered both 
in the usual setting where the optimal trajectory is interior to the 
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state region and in the state constrained setting. In the unconstrained 
(in the state sense) case a generalization of the "bang - bang" 
principle is obtained, and analogous results are found for the state 
constrained case. This also leads to a new necessary condition, for 
the bounded state problem. This condition is always satisfied in the 
unconstrained case. 

Finally, Chapter 6 summarizes the results obtained and offers 
some recommendations for possible extensions and generalizations. 
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2. BOUNDED STATE PROBLEMS AND THE QUESTION OF EQUIVALENCE 

In this chapter a general optimal control problem with bounded 
state variables is defined and transformed into an equivalent Lagrange 
Problem by means of suitably chosen mappings defined on the state and 
control regions* Precise conditions for this equivalence will be 
formulated and specific mappings exhibited for particular state and 
control regions satisfying these conditions. 

2.1 The Problem 

Let be a subset of R m , euclidean m-space, and T a subset of R n . 
The sets ft and T will be referred to as the control region and state 
region respectively. Let x° and x 1 be points in F ; x° will be referred 
to as the initial point and x 1 the terminal point. The results herein 
may be easily generalized to the case where one has initial and terminal 
mhnifolqs rather than single points. These, usually defined by systems 
of finite equations, are disjoint, closed subsets of T. 

We now consider a differential system whose state at time t is 
characterized by a vector x(t) = (x^ (t) , • . . ,x (t) ) in T and whose value 
is determined or controlled at that time by a vector 

u(t) = (u 1 (t) , . . . ,u (t) ) in ft, the so-called control vector. Let the 
system be defined by the differential equations x = f(t,x,u) where f is 
assumed to be a continuously differentiable vector-valued function 
defined on R x T x Q where R is the real line. This is a non- autonomous 
system of n non-linear first order differential equations in the n + m 
unknowns x and u* Let an initial time t be given and designate the 
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final time, which is variable, by tj. 

In addition, let f Q (t,x,u) be a continuously differentiable real 

valued function defined on R x r x ft. We will refer to f as the cost 

o 

function and its integral over the interval [t ,tj] as the cost 
functional,. 

The problem to be considered is the following: 

Problem I . 

Find a sectionally continuous control u(t) defined on [t Q ,t ] for some 
t 1 > t so that there exists x(t) , sectionally smooth, defined on 
[t Q ,t 1 ] such that 

(1) x(t) = f (t,x(t) ,u(t) ) for all t e [t jtj] for which x(t) is 
defined, 

(2) x(t) e T and u(t) e ft for all t e [t Q ,t ], 

(3) x(t Q ) * x° and x(tj) = x 1 , 

f fc i 

(4) f (t ,x(t) ,u(t) ) dt is minimized. 

J 1 ° 

o 

This is an example of an optimal control problem with restricted state 
variables. Such a control u(t) satisfying (1), (2), and (3) is called 
an admissible control. A control for which (1) , (2) , (3) , and (4) is 
satisfied is called an optimal control, and its corresponding x(t) is 
called an optimal trajectory. 

2.2 Review of Literature 

this type of problem has been considered by a number of people. 
The papers by Berkovitz (2), Guinn (10) and Russak (21) have already 
been discussed in Chapter 1. They both use the method of slack 
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variables to translate necessary conditions for the Bolza problem into 
necessary conditions for the above problem. 

Other approaches have been more direct and do not utilize the 
Calculus of Variations. Gamkrelidze (20) adjoined the total time deriv- 
ative of the state constraint to the cost functional and treated the 
resulting problem in the same fashion as he and Pontryagin did for the 
unconstrained problem. His "regularity" assumption, which also arises 
in this thesis but for different reasons, is made in order to insure 
that the control explicitly appears in the derivative of the constraint 
so that it directly influences his new cost functional. 

In conjunction with numerical applications, Bryson et al. (4) 
developed techniques for avoiding this assumption in some cases by 
using higher order derivatives of the constraint. Results related to 
this are also demonstrated in a paper by Speyer and Bryson (24). 

Dreyfus in (8) and (9) uses the method of dynamic programming to 
obtain results similar to those of Berkovitz. In fact, Berkovitz and 
Dreyfus compared their previous results in a joint paper (3) in 1965. 

The penalty function approach is entirely different from all of the 
above. This method, first described by Chang in (6) and (7), involves 
the following. Instead of attempting a direct solution, an uncon- 
strained problem is considered wherein the original cost functional is 
augmented by a non-negative penalty function which sharply increases 
the cost associated with trajectories which violate the state 
constraints. By using sequences of cost functionals involving more and 
more severe penalties it is to be expected in many cases that the 
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desired solution to the original problem will be obtained as the limit 
of the solution of the approximate problem. This technique has been 
subsequently refined and further results obtained by Russell (22) and 
Jacobson et al. (14) , 

In (16) Khrustalev discusses the very difficult question of suffi- 
ciency for constrained state problems, and finally Mclntire and 
Paiewonsky, in an expository paper written in 1964, survey the 
techniques known at that time. 

2.3 Transformation into a Lagrange Problem 
We shall now state the method to be used herein to treat problems 
of the type described in 2.1 as Lagrange problems in the Calculus of 
Variations. 

k. 

Consider the functions ip and <j> defined so that R -> ft and 

£ 

<t>: R + T where k and £ are natural numbers. Assume that both ip and <p 
are onto and continuously differentiable. Then if we let x = cf>(y) and 
u « i|,(z) and restate our original problem in terms of the new variables 
y = (yp...^) and z = (z^... ,z fc ) we obtain the following: 

Problem II . 

Find a sectionally continuous z(t) defined on [t ,t. ] for some t, > t 

oi i o 

so that there exists y(t), sectionally smooth, defined on [ t Q , t ^ ] such 
that 

(1) ' |^- (y(t)) y (t) = f (t,<Ky(t)) , <|/(z(t))) for all t e [t Q ,t 1 ] 

for which y(t) is defined, 

(2) ' y(t) e and z(t) e R^ for all t e [t ,t ], 

(3) * ^(y(t Q )) = x° and (y ( t x ) ) = x 1 . 
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(4)' 


1 f ( t , 4> (y ( t) ) , *Kz (t) ) ) dt is minimized. 


In (1) f is used to denote the n x i matrix of partial deriv- 

atives of the components of <P with respect to the components of the 
vector y. Notice also that this new problem is unconstrained in the 
sense that the new variables y and z are allowed to take on any values 
itt ^ and respectively. 

The variable z which takes the place of the control u is intro- 
duced as a derivative so that the new problem will satisfy the 
hypotheses for the necessary conditions to the Lagrange problem which 
require that solutions, in this case y(t) and z(t), be sectionally 
smooth. That is, a sectionally smooth solution of problem II, y(t) 
andz(t), will lead to a sectionally continuous control u(t) = iKz.(t)) 
and a sectionally smooth trajectory x(t) = <Ky(t)) as its counterpart 
in problem I. 

We may now view problem II as a Lagrange problem with unknowns y 
and z, constraining differential equations given in (1) ? , boundary 
conditions (3) 1 and with (4) 1 giving the functional to be minimized. 
Hence all the well known classical theory associated with this problem 
may be applied in order to find solutions. A full discussion of this 
theory is contained in Chapter 6 of (23) . 

However, before we proceed farther along these lines, we must 
establish under what conditions is problem II actually equivalent to 
problem I, in the sense that a solution to problem II leads to a 


solution of problem I and vice versa. That is, we must be able to 
translate necessary and sufficient conditions for y(t) and z(t) in 
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problem II into necessary and sufficient conditions for u(t) = ^ ( z ( t) ) 
and x(t) = (y ( t) ) as optimal controls and optimal trajectories in 
problem I* The next two theorems provide an answer to this question* 
Definition 2.1 . The mappings ip and < p shall be called an acceptable 

pair of transformations provided the following are true: 

1c £ 

(1) ip: R ft, <J>: R T for some k and & and both are onto and 
continuously differentiable; 

(2) for any sectionally smooth x(t) defined on some interval 
[t Q ,t ] such that x(t) e F for all t, there exists a 

£ 

sectionally smooth y(t) defined on such that y(t) c E 

and cj>(y (t) ) = x(t) for all t e [ t Q , t ] ; 

(3) for any sectionally continuous u(t) defined on soine interval 

[t ,t ] such that u(t) e Q for all t, there exists a 
L o 9 i 

sectionally continuous z(t) defined on [t Q ,t ] such that 


xp (S ( t) ) - u(t) for all t £ [t Q# t ]• 

Theorem 2*1 * If ip and <J> are an acceptable pair of trails formations and 


z(t) , y(t) defined on [t Q ,t ] yield a solution to problem II then 
u(t) = ip(z(t)) and x(t) = 4>(y(t)) yield a solution to problem I. 

Proof: We shall verify that x(t) and u(t) satisfy (1) , (2) s (3) and 

(4) of problem I* Since x(t) = (y (t) )y (t) = f (t ,<Ky(t)) ,^(z(t))) ® 

f (t,x(t) ,u(t)) we see that (1) is satisfied. Clearly (2) follows from 
(1) of definition 2*1, and x(t Q ) = 4>(y (t Q )) = x°, x(t^) = <Ky(t x )) = x 1 , 
so (3) is satisfied. Now suppose (4) were not true, then there would 
exist x(t), u(t) and t^ satisfying (1), (2) and (3) such that 


f q ( t , x ( t ) ,u(t) ) dt < 


ft, 


f Q ( t , x ( t ) ,u(t) )dt. 
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Now applying (2) and (3) of definition 2.1 there exists y(t) and z(t) 


such that x(t) = <Ky(t)) and u(t) = *Kz(t)) for all t c [t Q ,t ]. 

0 0 0 
Then we have 4> (y ( t) ) y(t) = x(t) = f (t ,x(t) ,u(t) ) « f (t ><Ky(t)) ,^(z(t))) 


o — , 


for all t and moreover x = x(t Q ) = <Ky(t o )), x 1 = = ♦ (y(t 1 )). 

However , 


ft. 


l f 0 (t,Hy(t)),^(z(t)))dt = 


f 0 ( t j X (t) ,u(t))dt < 


'1 


f(t,x(t) ,u(t))dt = 


'1 


f 0 (t , 4> (y ( t) ) ,^(z(t)))dt 


o o 

which contradicts the fact that y(t) and z(t) yield a solution for 
problem II. Thus (4) must be true and hence x(t) and u(t) yield a 
solution to problem I. 

The next theorem is the converse of theorem 2.1. 

Theorem 2.2 . If i|> and <p are an acceptable pair of transformations and 
u(t) , x(t) defined on [t Q ,t 1 ] yield a solution to problem I then any 
sectionally smooth y(t) and z(t) such that x(t) = <K y(t)) and 
u(t) = <p(z(t)) for t e [t Q ,t 1 ] yield a solution to problem II. 

Proof: We know that at least one such y(t) and z(t) exist by (2) and 

(3) of definition 2.1. We must therefore show that such a y(t) and 
z(t) satisfy (1)’, (2) * , (3)' and (4)' of problem II. Notice that 
<t> y (y(t))y(t) = x(t) = f (t,x(t) ,u(t)) = f(t,<J>(y(t)) ,^(z(t))); thus we 
have that (1)’ is true. Moreover (2)' is trivially satisfied and since 
4> (y ( t Q ) ) = x(t Q ) = x° and 4>(y (t j) ) = x(t x > = x 1 , (3)' is also. Now if 
y(t) and z(t) defined on [t Q ,t ] satisfies (1)', (2)', (3)' and 


ft, 


f 0 (t,<Ky(t)) ,Kz(t)))dt < 


f 0 (t,*(y(t)),*(i(t)))dt 
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then it is easily seen that x(t) = <Ky(t)) and u(t) = ip(z(t)) will 
contradict the optimality of x(t) and u(t) • Thus y(t) and z(t) must 
yield a solution to problem II. 

The following corollary follows immediately from theorems 2.1 
and 2.2. 

Corollary 2.1 . If ip and $ are an acceptable pair of transformations 
then any necessary or sufficient condition for y(t) and z(t) to be a 
solution of problem II yields a necessary or sufficient condition 
for x(t) and u(t) to be a solution of problem I when the condition 
is restated in terms of x(t) = <Ky(t)) and u(t) = iKz(t)). 

A much more general discussion of the equivalence of minimization 
problems is contained in (18), particularly in regard to unconstrained 
problems . 


2.4 The Control and State Regions as Right Parallelepipeds 
Consider problem I with 

ft = (u e R m : a. < u. < b., i = l,...,m} and 
i — l — l 

T “ (x e R n : c, < x, < d. , i = l,.,.,n} 
i — l ~ x 

where a.,b.,c. and d. are real numbers such that a. < b. and c. < d, 

ill i l i i i 

for each i. Then ft and T defined in this way constitute right 
parallelepipeds. Now define \p: R m ft by ip = where 

u^ = ^( 2 ) = j ((b - a_^) sin z^ + (b_^ + 

for i = l,...,m. Also let < p : R n + F be given by (p = (<(>-...,(() ) where 

1 n 



13 


x ± = ^(y) = 2 ^ d i “ C i^ S±n y i + ^ d i + c i^ 
for i = 1, . . . ,n. 

Theorem 2,3 . For this choice of SJ, T, i(i and <f>, the mappings ^ and $ 
constitute an acceptable pair of transformations. 


Proof: Since -1 < sin z. < 1 for all real numbers z . , we have 

— i — ■ i 

(a - b ) (b - a ) # (b - a ) 

s in ^ > an d therefore 


(a. - b ) (b + a, ) (b -a.) (b + a , ) 

a i ' 2 ' + " 2 - i ' 2 6in 4 i + 


(b - a > (b + a > 

i j + 2 b i 


for i = 1, . . . ,n. 

Thus we see that <p(z) e ft for all i t R m , Similarly we can show that 
4>(y) e r for all y e R n . Moreover <p and <f> are clearly continuously 
differentiable. Now let u(t) be a sectionally continuous function 
defined on [t o ,tj] such that u(t) e for all t e [t o ,t ]. Define 
H t) by 

^(t) “ sin 

We first notice that each z^Ct) is well defined for all t e [t Q ,tj]. 
This is true since u(t) e £2 means that a^ ^ u^t) <_ b^ for i » l,...,m 
and hence it follows that a^ - b^ = 2a^ - (b^ + a^) 2u^(t) - 

< b l + a ± ) < 2b t - (b t + a ± ) - b f - a ± . 


2u i (t) - (b^ + a i ) 

(bprsp 
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Dividing this inequality by we obtain 

2u (t) - (b + a ) 

-1 £ ■ - ■ £ 1 . 


<b i ' a i ) 


. "I 


Hence the sin of the middle term in the above inequality is well 
defined. Note that this argument yields that ^ is onto since u(t) 
may in general be any point of ft and clearly if z is defined in the 
above fashion for any u e ft we have ^(z) * u. Moreover , "since sin ^ 
is a sectionally continuous function, we see that the z(t) which we 
have defined will be sectionally continuous also. Therefore ^ 
satisfies condition (3) of definition 2.1. If x(t) is a sectionally 
smooth function defined on [t Q ,t 1 ] such that x(t) e T for all 
t e [t Q ,t ], we can satisfy condition (2) of definition 2.1 by 
defining 


/x . -1 

y i (t) ■ sin 


2x i (t) - ( d ± + c i ) 


+ 2tt 


for i = and t e [ t o , t ^ ] where j(i,t) is a integer chosen for 

each i and t to insure that y^Ct) is not just sectionally continuous 
but also sectionally smooth. Since sin (2 tt j ) = 0 for all integers j, 
this term does not affect the relationship <Ky(t)) = x(t) . Thus the 
theorem is proved and we have shown that ip and <p are an acceptable 
pair of transformations. 

We now have that for this particular choice of ft, T, \p and <j> that 
theorems 2.1 and 2,2 apply, and hence, in this case, problem I is 
equivalent to problem II. 


i 
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In the case that some of the components of u or x are to be 

unrestricted, while the remainder are constrained between maximum 

and minimum values as previously, one simply defines, t|^(z) = u^ or 

cf>i (y ) = x for those particular ones and defines the rest of the 

components of ip and <J> as is done previously with the sine function. 

Clearly, in this case, the results of theorem 2.3 also follow. For 

numerical applications the previous formulation is more desirable 

as in effect it covers both cases. When a component of u or x is to 

be unrestricted, one simply inputs to the system maximum and minimum 

values of the variable which are exceedingly large and exceedingly 

small respectively, thus effectively eliminating the constraint. 

For simplicity sake, in the remainder of this text we shall let 

Q be the unit m-cube and F be the unit n-cube. That is, we will set 

a, = -1, b, = 1 for i = l,...,m and c. = -1, d. = 1 for i = l,...,n. 
l 9 i i l 99 

Then ip and 6 reduce to'i}(z)= u = sin z = (sin z_,,..,siri z ) and 

i ’ m 

<Ky) = x = sin y « (sin y 1 ,...,sin y ) . 

Examples of acceptable choices of ip and < p corresponding to more 
general control and state regions are contained in (18). 
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3. NECESSARY CONDITIONS FOR THE NONLINEAR BOUNDED STATE PROBLEM 

Let us first introduce for notational convenience an operation 

on pairs of vectors which we shall designate by If a = 

and b = (b.)? - are n-vectors, then define a#b e R n to be the n-vector 
x 1=1 

(a.b,)? -o Clearly is commutative, associative and distributive 
i i i=l 

with respect to addition. That is, if a,b,c e R n then a*(b 4- c) = 
a*b + a*c. We shall use a*b to designate the ordinary dot product of 
the vectors a and b. 

In addition, for a e R n we introduce the notation "diag (a)" and 
define diag (a) to be the n x n diagonal matrix whose diagonal elements 
are precisely the components of a. Notice that for a,b e R n we have 
diag (a)b = a*b . 

Now consider problem I in section 2.1 with 


Q = (u £ 

R m : 

lu.l H, 

l 1 , . . . ,m) , 

the 

unit 

m-cube, and 

r = {x e 

R n : 

! x ± l i i, 

I 1, • , • ,n} , 

' the 

unit 

n-cube. 


Then if we define cj) and if j by 

c p(y ) = x = sin y and 

ip(z) = u = sin z 9 

then theorems 2,1, 2.2, and 2.3 insure that for this case problem I is 
equivalent to problem II in section 2.3. Note that we now have that 
= diag (cos y) so that y = cos y*y. Hence the constraining 
differential equations for problem II become 
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cos y*y = f(t,sin y,sin z) . 


The boundary conditions are 

sin (y(t Q )) = x°, and sin (yCt^) = x 1 . 


and the cost functional is 


1 

f Q (t,sin y(t),sin z(t))dt. 

^ t 

o 

Define the functions ^ and taking values in R U by 


$ 0 (t,y,z) .= f 0 (t 1 ,sin y,sin z) , 
®(t,y 9 y,z) = cos y*y - f(t,sin y,sin z) , 
^ (y) = sin y - x° and 
(y) = sin y - x 1 . 


Then, if we restate problem II of section 2.3 for the particular 
problem considered in this chapter using the above functions, we 
obtain: 

Problem II T 

Find piecewise smooth vector functions y(t) and z(t) defined on 

[t ,t n ] where t is fixed and t, is variable which satisfy the 
o 1 o 1 J 

constraining differential equations $(t,y,y,z) = 0, the initial 
conditions W Q (y(t o )) = the terminal conditions V (y(t^)) = 0 and 


is such that 


(t ,y(t) ,z(t))dt is minimized. 


The following theorem summarizes the results found in chapter 6 
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of Sagan’s book (23) concerning properties of solutions to the above 
problem. 

Theorem 3.1 . If y(t) and z(t) defined on [ t 0 > t 1 l constitute a solution 
to problem II v and if the following conditions are satisfied 

(1) and are continuously differentiable, 

(2) along ;y(t) and z(t) we have rank( y^ - \y ) = n for all 

3<f> 

t e [t -t.] where -r-r-. — rr is the n x (n + m) matrix of 
o l 9(y»z; 

partials of the components of $ with respect to the compo- 
nents of y and z; 

9 ^ 8 ¥ 

(3) rank (■£— (y(t Q ))) = n = rank (-g— (yCtj))) , 

then there exists a vector function A(t) = (A,(t),...,A (t)) defined 

1 n 

on [ t Q » t ^ ] which is continuous except possibly at the points where 
y(t) and/or z(t) are not smooth and a constant A q £ 0 with 
(X of Xi(t) f ...,A ( t) ) 4 (0,0,. . . ,0) for all t e [t Q ,t ] such that the 
following statements are true: 

(1) Euler Lagrange Equations 

define h(t,y,y,z,A) = -A $ (t,y',z) + A • $ (t ,y ,y , z) , then the 

o o 

differential equations 

h (t,y,y,z,A) - lu(t,y,y,z,A) = 0 and 

4r h. (t,y,y,z,A) » 0 
dt z 

are satisfied along every smooth arc of y(t) and z(t) where 
hy 5 h^, and hu denote the vectors of partials of the function 
h with respect to the vectors y,y and z respectively; 
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(2) Corner Conditions 

across every corner of y(t) and z(t) we must have that each 
component of 

tu (t,y(t) »y(t) ,z(t) »A(t )) , 

h. (t,y(t) ,y(t) ,z(t) ,A(t)) , and 
z 

h(t s y(t) ,y(t) ,z(t) ,A(t)) - y(t) (t,y(t) ,y(t) ,z(t) ,A(t)) - 
- z(t) *h. (t,y(t) ,y(t) ,z(t) ,A(t)) 
are continuous ; 

(3) Transversality Conditions 

there exists a constant vector v = (v 1JO o 0 ,v ) such that 

1 n 

U o ,v 1 ,...,v n ) =f= (0,...,0) and 

0 = - H - h. *y - h. * z 

oo y ^ z 

= ho and 
oy y 

0 = h e 
z 

where the above relations are evaluated at t Q> y (t Q ) ,y (t ) ,z(t Q ) 

and X(t Q ) , and there exists a constant vector jj ® (m ,*..,1^) 

such that (A (0, ee8 ,0) and 

0 = - A $ - h. »y - h. • z 

oo y z 

uYo = h e and 

iy y 

0 = h. 
z 

evaluated at t 9 y (t 2 ) ,y(t ) ,z(t x ) and A(t ); 

(4) Clebsch Condition 

for each t e [t .t,] and for all vectors a e R n s p e R m 
which are solutions to the linear systems 


^(t ,y(t) ,y(t) ,z(t))a = 0 


and 
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$^(t »y(t) ,y(t) ,z(t))p = 0 
we must have that 

a*h..a + p*h..a + a-h..p + p*h..p > 0 
yy zy yz zz - 

evaluated at t,y( t) ,y (t) ,z(t) and A(t) provided the 

functions f and f are twice continuously differentiable 

(see Hestenes (13)); 

(5) Weiers trass Condition 

• • 

for all (t,y(t),y,z) satisfying the constraining equations, 
t e [ t , t 1 ] , we have 

E(t,y(t) ,y(t) ,z(t) ,y,z,A(t)) ^ 0 

where 

E(t,y,y ,z,y,z,A) = h(t,y,y,z,A) 

- h(t,y,y,z,A) + (f - j) *h^(t ,y ,y ,z,A) 

+ (z - h (t,y ,y,z,A) # 

Using these necessary conditions we shall now determine what 
they mean in terms of our particular problem. Notice that since 
f and f were assumed to be continuously differentiable we have that 
and ^are by definition continuously differentiable ^ Hence 
assumption (1) of theorem 3.1 is satisfied for our particular 
problem. 

Now let x(t) and u(t) defined on [t Q , t ^ ] yield a solution to 
problem I of section 2,. 1, with our particular control region ft and 
state region F # Then by theorem 2.3 there exists y(t) and z(t) such 



21 


that sin (y(t)) = x(t) and sin (z(t)) = u(t) for all t e t t Q , t 3 , and 
moreover, theorem 2.2 tells us that y(t) and z(t) yield a solution to 
problem II s . We assume that hypotheses (2) and (3) of theorem 3.1 
are satisfied. Therefore theorem 3 0 1 may be applied to y(t) and z(t) 
thereby establishing corresponding conditions on u(t) and x(t) . How- 
ever, before proceeding further, some results should be exhibited 
describing under which circumstances assumptions (2) and (3) of theorem 
3.1 will be met. The next four theorems will accomplish this. 

Let M(t) be the n x m matrix f^ diag (cos z) evaluated at 
(t,y(t) ,y(t) ,z(t)) . Suppose at time t there are precisely £ state 
constraints in effect; that is, £ components of the vector x(t) have 
absolute value one. Define M(t) to be the H x m submatrix of M(t) 
consisting only of the rows of M(t) corresponding to components of 
x(t) with absolute value one. 

Theorem 3.2 0 For all t e [t ,tj], M(t) has rank £. 

Proof l This theorem follows from assumption (2) of theorem 3.1. 

Since $ = cos y*y - f 3 we see that 

¥(|~ ' z ~ ) ' = ^ diag ( cos yV)), M(t)). 

If |x^(t) | < 1 then since x^(t) « sin (y^(t)) we have that cos y^(t) 4 0 

and thus the rows of the above matrix corresponding to the n - £ 

components of x(t) with absolute value less than one are linearly 

independent. However, for the £ coordinates where |x^(t)| = 1 we have 

that cos y.(t) = 0 and therefore — rr* will have maximum rank if and 
i 9(y,z; 

only if the rows of M(t) corresponding to these components are 
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linearly independent 0 In other words this means that M(t) must have 
rank £ . 

We can see from the above proof that this theorem gives a 
necessary and sufficient condition for assumption (2) of theorem 3.1 
to be satisfied. Moreover* it can be easily seen that all trajectories 
which never meet the state constraint boundary automatically satisfy 
hypothesis (2) . 

Theorem 3.3. If x(t) is in the interior of T for all t £ [t 3 t, ], 
o l 

then hypothesis (2) of theorem 3.1 is satisfied. 

Proof g In this case |x (t)| < 1 for all i = l,...,n and t £ [t Q ,t 1 ] 
and so cos (y^ ( t ) ) =f 0 for all i = l,...,n and t e [ t , t ^ • Therefore 
rank (diag (cos y(t))) = n and hence* 

3 $ 

rank -gjj jj = rank (diag (cos y(t)),M(t)) « n. 

The next theorem tells us something about when a system is over- 
constrained. It simply says that at any time/ there cannot be more 
constraints in effect* both state and control* than there are control 
variables. Suppose that at time t there are k components of u(t) such 
that |u^(t)| - 1; that is* there are k control constraints in effect. 
Theorem 3 0 4 . k 1 + £ <_ m at every time t e [t Q ,t ]. 

Proof : According to theorem 3.2* the £ x m matrix M(t-) must have rank 
£. Now M(t) = f diag (cos z(t)) and consequently each element of 
column i of M(t) is multiplied by cos z^(t) for i = l,...,m. More- 
over* each element of column i of M(t) is multiplied by cos z.(t) also. 
For each component of u(t) with absolute value one we have that 



cos z^(t) = 0 since uu(t) = sin (z^(t)), Hence at least k columns 
of M(t) have all zero elements . Thus the rank of M(t) is at most 
m - k s and so 

rank (M(t)) = & <_ m - k or equivalently k 4* Z <_ m, 

The next theorem gives a necessary and sufficient condition for 
hypothesis (3) of theorem 3*1 to be satisfied* 

Theorem 3*5 * The vectors x(t Q ) and x(t^) are interior to T . 

Proof: Recall that ¥ = sin (y) - x° and ^ = sin (y) - x 1 . Thus we 


• ay 3Y, 

O I 

have t- — = diag (cos y) = and this matrix will have rank n if 

3 y 3y 

and only if cos y_. I 0 for i = l, ao .,n ? or equivalently, |x | <1 

for i = l,...,n. But this is the same as requiring that x be interior 

to T. Therefore x(t) will satisfy hypothesis ( 3 ) of theorem 3*1 if 

and only if both x(t Q ) and x(t^) are interior to T. 

We shall now proceed to determine the consequences of conclusions 
( 1 ) through (5) of theorem 3 0 1 for x(t) and u(t) . 

Let the function H 3 called the Hamiltonian 9 be defined by 


H(t ,x,u, A) = A f (t,x,u) + A*f(t,x,u). 

Theorem 3*6, There exists a vector function A(t) = (A 1 (t),.,..A (t)) 

i n 

defined on [ t , t ^ ] which is continuous except possibly at the points 

where u(t) is not continuous and a constant A <0 with 

o — 

(A (t)) =f 0 for all t e [t »tj] such that for i = l,...,n 


either 
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and for j = 1 m either 

|u.(t) | = 1 or (t,x(t) ,u(t) ,A(t)) = 0 

J U j 

on every smooth arc of x(t) . 

Proof: First notice that 

h = A • (y*cos y) - H(t,sin y,sin z,A) 

and thus h. = A*cos y, so 

XT * * 


— tu « A*cos y - A*sin y*y, and 


hy = - A *y *s in y - H^(t,sin y,sin z,A)*cos y, 


Hence conclusion (1) of theorem 3.1 becomes 


hy ” = - A*y*sin y - H^(t,sin y,sin z, A)*cos y 


- A*cos y + A*sin y*y = 0 


That is , 


(A + H (t,sin y,sin z,A))*cos y - 0 
x 

from which the first part of the theorem follows, 
part we have 


For the second 


h. 55 H (t. sin y, sin z,A)*cos z 
z u 

and so from the Euler Lagrange equations H^*cos z must be constant 
on each smooth subarc of x(t) . In addition, the corner conditions 
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tell us that this quantity is continuous, and hence H^*cos z must 
be constant for all t e [t Q ,t ]. Moreover, looking at the trans- 
versal! ty conditions we see that h. = H *cos z = 0 for t = t and 

z u o 

t = tj. Thus, along the solution 

H^(t,sin y,sin z,A)*cos z = 0 
from which the second part of the theorem follows. 

Corollary 3,1 , If x(t) is in the interior of T for all t e [t Q ,t 1 ], 
then 

A (t) + H x (t,x(t) ,u(t) ,A(t)) » 0 
on every smooth subarc of x(t). 

Corollary 3,2 , If u(t) is in the interior of ft for all t e [t Q ,t 1 ], 
then 

H^(t,x(t) ,u(t),A(t)) = 0 
on every smooth subarc of x(t) . 

Definition 3.1 , Let i be an integer such that 1 i n; then the 
point x(t*) for t* e ( t q , 1 1 ) is called an i-boundary corner of x(t) 
provided either x(t) is not differentiable at t = t* or t* is a point 
of discontinuity of u(t) , and |x^(t*)| = 1. 

Theorem 3,7 . For each i = 1 , , . , ,n , A (t) is continuous except possibly 

at i-boundary corners of x(t) . The function H(t ,x(t) ,u(t) , A (t) ) is 

continuous for all t e [t , t,l. 

o 1 , 
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Proof: From the corner conditions, since h. = A*cos y we must have 

y 

that the product A^(t) cos y^(t) is continuous across each corner 
of (y(t),z(t)). Let t* be a corner of (y(t),z(t)). Then either 
x(t) is not differentiable at t = t* or t* is a point of discon- 
tinuity of u(t). Suppose t* is not an i-boundary corner. Then 
|x^(t)| < 1 and consequently cos (y^Ct*)) =(= 0. Thus since cos (y (t)) 
is a continuous function which is non-zero for t = t* and 
A_^(t) cos y^(t) is continuous at t = t*, we must have that A_^(t) is 
continuous at t = t*. Since we already know that A_^(t) is continuous 
at all points where (y(t),z(t)) is smooth, the first part of the 
theorem follows. In addition the corner conditions yield that 
h - y-h. - z*h. must be continuous across corners. But it was seen 

y z 

in the proof of theorem 3.6 that along the solution h^ = 0 and 
H = A • (y*cos y) - h » y*h^ - h. Therefore H(t ,x(t) ,u(t) , A (t) ) is 
continuous across all comers of x(t) , and hence it is continuous 
for all t e [t , t, ] . 

Corollary 3.3 . If x(t) is in the interior of T for all t e [t Q ,t 1 ], 
then A(t) is continuous on the interval [t ,t^]. 

Theorem 3.8 , The Hamiltonian function evaluated at the initial and 

terminal points of a solution is zero. That is > 

H ( t , x ( t) ,u(t),A(t) ) = 0 

for t ”• t and t = t, . 

0 1 

Proof : We shall apply the t ransversality conditions of theorem 3.1. 

Notice that $ = f , and recall from the proof of theorem 3.6 that “ 

o o 

h. - 0 along a solution. We also have that h. = A*cos y, and so 

z y 



tu*y « (A*cos y) .y-= A*(y*cos y) = A* f -along a solution. .Thus 

the transversal! ty conditions yield that A Q f Q + A*f = H = 0 for 

t = t and t = t . 
o 1 

The next ' theorem follows from theorem 3.6. 

Theorem 3.9 . If the function u(t) is differentiable except at its 
points of discontinuity then H(t ,x(t) ,u(t) , A ( t ) ) is differentiable 
except at these points and 

■^7 H(t,x(t) ,u(t) ,A(t)) = H t (t,x(t) ,u(t) ,A(t)) . 

Proof ; Since = f which is equal to x along our solution, 

4~ H = H + H .x + H -u + H, *A 
dt t x u A 

= H + (A + H ) -x + H -u 
t x u 

From theorem 3.6, on every smooth arc either | x^ | = 1, in which case 

x, = 0, or 1, + H =0 for i = l,...,n, and also either lu.l * 1, 
x ix. j 

i J 

in which case u. = 0, or H =0 for j = l,,..,m. Therefore, using 

3 u j 

these facts, we see that the last two terms on the right hand side of 

the above equation are zero and hence, along each smooth arc of our 

solution H = H . 

dt t 

Corollary 3.4 . In the case of an autonomous problem, that is, if the 
functions f and f are independent of t, then 


H(t,x(t) ,u(t) , A(t) ) = 0 
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Proof : This follows from theorems 3. 7, 3, 8,, and 3.9. 

Theorem 3.10 . Let t £ [ t * t j ] and y £ R m . Decompose y into two 

subvectors y consisting of those components of y corresponding to 

coordinates of u(t) for which |u_.(t)| = 1, and Y consisting of those 

coordinates of u(t) for which |u.(t)| < 1. Decompose the vector u 
i 3 

similarly. Then for all Y and for all Y satisfying the equations 

f-Y = 0, 
u 

T*( H -*U*T) - Y-H_y > 0 at time t. 
u uu “ 

Proof : We shall apply the Clebsch condition of theorem 3.1. The 

matrices h..,h., and h. . are all 0, Thus the Clebsch condition 
yy zy yz 

reduces to: p«h..p > 0 for all p £ R m such that $.P = 0. Now by 
zz ~ z 

differentiation and appropriate substitution we obtain 

= - f diag(cos z) and h.. = diag(H *u) - diag(cos z)H diag(cos z).. , 
z u z z u uu 

Therefore , 

p*h..p = p*diag(H *u)p - p*(diag(cos z)H diag(cos z)p) 
z z u uu 

= p • (H *u*p) - (p*cos z)*H (p*cos z) , and 
u uu 

$.p = - f (p*cos z) . 
z u 

Let y e R m such that y satisfies f^y = 0. Set p~ = T and let p have 

Y k 

components p, = . This is possible since cos z. =)= 0 for 

K COS Z. K. 

k 

each k such that u^_ e u. Then clearly y - p*cos z. For each k such 
that u^ £ u we have |u^| <1, and consequently by theorem 3.6, 

( t , x ( t ) , u ( t ) , A ( t ) ) = 0. Thus, p • (H^*u*p) = Y*(H^*u*Y) “ Y* (H_*u*y) 
because the other terms involving components of p are all zero. 
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Also, since Y = P*cos z, f~(p*cos z) - 0 and since cos L = 0 for 

U K. 

all k such that z, e "z, we obtain $.P = - f (p*cos z) = 
k z u 

n 

= - f~.(p*cos z) =0. Moreover, for the same reasons 
u 

(p*cos z)*H (p*cos z) = Therefore, by the Clebsch condition 

uu uu 

we obtain 

P*h..P = P*(H *u*p) - (p*cos z) «H (p*cos z) 
zz u uu 

- T- (H_*U*T) - Y*H~~Y > 0 

u uu ~ 


and the theorem is proved. 

The next corollary is perhaps a more usable form of theorem 3.10 

Corollary 3.5 . Let t e [t ,t x ] and TT,u ,T and Y be defined as in 

theorem 3.10. Then each component of the vector H^*TT is non-negative 

and for all Y such that f^Y = 0, Y*H^Y < 0. 

u uu — 

Proof: Successively apply theorem 3.10 with Y = 0 and each component 

of T equal to zero except for the k-th component which is set equal 
to one. Then Y'H^Y = 0 and the result of theorem 3.10 reduces to 


H u> 0. 
~ 


uu 

Doing this for all k such that 




£ -u we obtain that each 


component of H_*U is non-negative. Now apply theorem 3.10 with 
T = 0 and Y any vector such that f~Y * 0. Then theorem 3.10 implies 


that yH~~Y < 0. 

uu ~ 

Notice that since corollary 3.5 clearly implies theorem 3.10, 
the condition in corollary 3.5 is actually equivalent to the condition 
in theorem 3.10. 

Theorem 3.11 . Let t e [t Q ,t 1 ]. Then for all v e R m such that |v.| <_ 1 
for i = 1, . . ,m and f(t,x(t),v) = 0 where f consists of all coordinates 
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of f corresponding to components of x(t) for which |x^(t)| = 1 , the 
following is satisfied: 

H( t ,x(t) ,u(t) , A (t) ) >_ H(t ,x(t) ,v,A(t)) . 


Proof : The Weierstrass condition in theorem 3.1 shall be applied. 


m 


Suppose v e R such that | | £ 1 for i = l,...,m and f(t,x(t),v) « 0. 

. -1 . f k (t,x(t),v) 

Let z = sin (v) and y. = 7 rrrr for all k such that 

k cos (y k (t)) 

|x^(t)| < 1 with y^ chosen arbitrarily for all k such that |x^(t)| = 1 . 

Then y*cos y(t) = f(t,x(t),v) = f(t,sin y(t) ,sin z) since 

cos y (t) *» 0 for all k such that |x^(t)| « 1 , and hence (t,y(t),y,z) 

satisfies the constraining equations. Now as was seen in the proof 

of theorem 3.7, along the solution x(t) ,u(t) we have tu = 0 and 

h - y*tu = - H. Moreover, using the fact that tu = A*cos y, 


h(t,y,y,z,A) - y (t ,y ,y ,z, A) 

= h(t,y,y,z,A) - (y*cos y) *A 
= - H(t ,x, v, A) . 


Therefore , 

. • 

E (t»yjy>z,y ,z,A) = H( t ,x ,u, A) - H(t,x,v,A) and so the Weierstrass 
condition yields that 

H( t ,x,u. A) _> H(t ,x,v, A) . 

This theorem is a generalization for the bounded state problem 
of the well known maximum principle of Pontryagin. 



Definition 3.2. (y(t),z(t)) is called a singular extremal provided 


y - y(t) 

2 = Z(t) 

is a singular matrix on some open interval contained in [t »tj]. 
Theorem 3.12 . If x(t) is on the boundary of r for some finite time 
in tt jtj], then (y(t),z(t)) is a singular extremal. 

Proof ; Differentiating we obtain 


a(h^,h.,$) 

3(y,z,A) 


and 



= $ = 0 , h. . 

x yz 


= diag(cos y) 


.. = 0 

zy 

* • 

y 


h., = - f diag(cos z) = $.. 
zA u ° z 


Therefore , 

3(h f> h 4> *) 

w 


I 0 

0 

^diag(cos y) 


diag(cos y) 


h.. 

ZZ 

- diag(cos z) 


- diag(cos z) 


which is clearly singular if cos (y^.(t)) = 0 on some open interval 
in [t that is, provided x(t) is on the boundary of T for 

some finite time. Hence (y(t),z(t)) is a singular extremal. 

This theorem shows that a solution to problem XI f is singular 
provided its corresponding solution to problem I contains a state 
boundary subarc. The definition of singularity given above is 



equivalent to the usual definition found in the literature. The 
relationships between various concepts of a singular extremal is 
investigated by Straeter in (25). 
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4. EXAMPLE OF A LINEAR PROBLEM WITH BOUNDED STATE VARIABLES 

The problem to be considered, a linear time-optimal control 
problem with bounded state variables, is the following. 

Consider the differential equation x = u where the control u 
is restricted by the condition |u| 1, and the state variables x 

and x are restricted by |x| 1 and |x| ^ 1. To be found is a 

sectionally continuous function u(t) which yields a solution to the 
above differential equation, subject to the given constraints, such 
that one arrives at the origin from a given initial state in the 
shortest possible time. That is, u(t) is defined on some interval 
[Ojtj] such that x(0) = x^, x(0) = x^ 0 and x(tj) = 0, x(t^) = 0 
and t 1 is minimal. 

If we let x 1 « x and x 2 = x, then clearly this is a special 
case of problem I of section 2.1 with t = 0, n~2, m=l, 

r = [-1,1] x [-1,1], ft = [-1,1], 



In this chapter we shall characterize solutions to this problem by 
using the conditions developed in chapter 3. 

First notice that in order to satisfy hypothesis (2) of theorem 
3.1, we must have |x (t)| < 1 for all t according to theorem 3.2. 
Since f^ is independent of u, the row in M(t) corresponding to x 1 
will always be zero. Thus, if |x (t)| = 1, M(t) would not have 



34 


maximum rank. Moreover by theorem 3.4, the number of constraints in 
effect at any time cannot exceed m, which is 1 in this case, and so 
we can never have |u(t)| = 1 and |x^(t)| = 1 simultaneously. In 
fact, if |x (t) | = 1 on some interval, then on that interval x 1 = 0, 
in which case, by the state equations, x^ = 0 on that interval, a 
contradiction . 

Let us now investigate subarcs interior to the state region, 

that is, where x_(t) < 1. We have H - - A + A x + A u and hence 

2 o 1 2 2 

O • 

according to theorem 3.6, A^ = 0 and A^ = - A ^ . Moreover, by theorem 
3.11, we have A^uCt) >_ A 2 v for all v of absolute value less than or 
equal to one. Therefore, u(t) - 1 provided A^(t) > 0 and u(t) » - 1 
when A^(t) < 0. From the above differential equations we see that 
A 1 is constant and A^ is linear with slope - A^. Notice that A^ 
is not identically zero since in this case A * - A^ would be zero 
also, which would mean by corollary 3.4 that A q = 0. This would 

violate the condition that (A ,A_,A^) 4 = 0. Moreover, interior to 

ol2 

the subarc, we cannot have A^ = 0 even at an isolated point since 
iri this case u would have to be discontinuous at such a point, 
switching from -1 to +1 or vice versa. Hence, on such a subarc, 
u(t) = +1 or u(t) = -1. Since by theorem 3.5 | x (0) | < 1, the first 
subarc of a solution must be of this kind. 

Type 1 (u = 1) 

If we integrate the state equations we obtain 


x 

x 


1 

2 


“ t 2 4- c 2 t 4 c 
t 4 c 



and elimination of t from the above yields 


1 2 
X 1 = 2 X 2 


+ ^ C 1 2 C 2 ^ 2 X 2 2 + C * 


In the state plane this defines a one parameter family of parabolas 
with the x^axis as their axis of symmetry and all opening to the 
right. Moreover, since x^ = u ~ 1, the movement along these 
parabolas is from bottom to top as t increases. Of course, for 
this problem we are only concerned with their portions lying in I # 
By applying the boundary conditions to the above we see that 
c = x^ and c 2 = x 2 °, ? and that it is possible to reach the origin 

1 9 

without switching only if one starts on the parabola x^ = — 
with x^ <0. 

Type 2 (u - - 1) 

Here we obtain 


X 1 ■ - \ t2 + d 2 t + d l 
X 2 ' ‘ + d 2 


and again eliminating t yields 


X 1 = ‘ 1 X 2 2 + (d l + 1 d 2 2) = 


\ x/ + d. 


In the state plane this defines a one parameter family of parabolas 
with the x^-axis as their axis of symmetry and all opening to the 
left. Moreover, since x = u = - 1, the phase point moves from top 
to bottom along one of these parabolas as t increases. Again the 
boundary conditions yield that d^ = x^ 0 and d 2 = x^ 0 , and it is 



36 


possible to reach the origin without switching only if one starts on 

X 2 

the parabola Xj = - — x 2 with x 2 0. 

Let us now consider the possibility of sub arcs on which 

|x 2 (t) | - 1. In this case we must have that x 2 = u « 0, according 

to theorem 3.6, since |u(t)| < 1, A^(t) - 0 along such a subarc, and 
© 

= Q as before. Integrating the state equations we see that either 
x 2 = 1 and x 1 =t+eorx 2 =-l and x^ = - t + f where e and f are 
constants. In the first case we move along the x^-boundary of T from 
left to right as t increases ^ In the second case we move from right 
to left. ’ 

By theorem 3.7, A must be continuous at each corner, and since 

it is constant on each possible type of subarc, we must have that 

A (t) = A^° for all t e [t Q ,t 1 ]. Also, by corollary 3.4, H = 0 

along the optimum trajectory. In particular this means that H is 

continuous across corners. Therefore, the term A^u must be continuous 

across corners because the other two terms in H, namely A + A x , 

0 12 

are continuous everywhere. However, we have seen that at each corner 
u is discontinuous switching between its possible values of +1, -1 
and 0. Therefore A^ must be continuous at such corners with value 0. 

Let us now collect this information obtained from the necessary 
conditions in order to develop an optimum strategy starting at an 
arbitrary point for which | x^ ° | < 1 and |x 2 °| <1. Observe that we 
can have at most three sub arcs. We must, as was shown previously, 
begin on a subarc of type 1 or type 2 on which A^ is linear with 
slope - A ^ . The function A^ must go to 0 as we approach a switching 
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point. If we switch to a subarc of type 1 or type 2 again, then 
can never again be 0 since on this subarc it will be linear with 
the same slope as previously . Thus we can never switch again. If 
the x^ boundary is encountered, that is, if the absolute value of 
becomes 1, then we switch to u = 0 and A^ is 0 along such a 
subarc. In order to get to the origin we must switch from this 
boundary subarc to one to type 1 or type 2 in which case by the 
above argument A^ can never become 0 again. Hence no further 
switching is possible. 

Therefore, to get to the origin from an initial point not 
lying on a parabola which leads to the origin we must do the 
following. Through (x^jX^ 0 ) there passes exactly one member of 
each type 1 and type 2. However, only one of these parabolas leads 
(in the direction of increasing t) to a parabola which leads to the 
origin or to a boundary of T where |x^| =1 which leads to a 
parabola which leads to the origin. So one must travel along that 
parabola with u = -1 or u = +1 (whichever is appropriate) and switch 
the value of u to the negative of its previous value when the parabola 
leading to the origin is encountered. If the boundary of T is met, 
we switch to u = 0 and move along the boundary until arriving at the 
parabola leading to the origin where u is switched again to either 


+1 or -1. 
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To summarize 3 if 


1. -1 < x 2 ^ 0 and 



use u = +1 


use u = -1 


2 . 0 x 2 < 1 and 


\ x 2 2 1 X 1 < 1 


- 1 < X 1 < " \ x 2 2 


use u = -1 


use u = +1 



use u = 0 


use u = -1 


use u = 0 


use u = +1 


o o 1 / 0\ 2 

However, if 0 and x 1 + y (x 2 ) ^_ 1 the subarc of type 

2 which must be taken first leads to the x } = 1 boundary which 
contradicts our first observation that | x 1 C t) | < 1 at all points 
along a solution. Similarly, if x 2 ° £ 0 and x ° - y ( x 2 °) 2 £ "1 
then the subarc of type 1 which must be used leads to the x 1 = -1 
boundary. Hence there can be no solution to the problem for the 
initial point in these regions. 

So we see that from each point in the interior of T, excluding 
these two regions, there emanates a unique trajectory satisfying all 
the necessary conditions of chapter 3 which leads to the origin. 

Only these trajectories can be optimal (solutions to our problem) . 
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Thus, if a solution exists from a given initial point then it must 
necessarily be the above unique trajectory passing through the initial 
point and going to the origin. To determine if these trajectories 
are indeed optimal, they must be examined in the light of sufficiency 
criteria. Unfortunately, very little has been done in the way of 
developing sufficient conditions for solutions to bounded state 
problems. The question of sufficiency for this example with an 
unbounded state region is investigated in (11). 
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5. THE LINEAR TIME OPTIMAL CONTROL PROBLEM 
WITH BOUNDED STATE VARIABLES 

In chapter 4 an example of a linear time optimal control problem 
with bounded state variables was considered. We shall take up the 
general problem in this chapter. That is , consider problem I in 
section 2.1 with 

ft as { u e R m : | Uj | j^l for j = 1 ,2 , « 0 io ,m) , 

r = {x e R n : |x_^| 1 for i = 1,2 , • • • »n} , 

f (t,x,u) = 1 and f(t,x,u) = Ax + Bu 

where A is an n x n matrix and B is an n x m matrix. Here the state 

equations are x = Ax + Bu which are linear in both the state variables 

and the control variables. The cost function in this case is 



o 


the total time duration of a trajectory. Therefore we seek a solution 
to the state equation x(t) and u(t) going from some initial state x° 
to some final state x 1 in minimal time. 

By way of notation, we shall use N (A) to designate the left 

L 

null space of a matrix. That is, N (A) consists of all vectors \ 

L 

such that XA = 0. 

5.1 Subarcs Interior to the State Region 
As usual ^e assume that x(t) and u(t) defined on [t^,^] yield 
a solution to the problem. Consider a sub arc of x(t) from time t ¥ 
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to time t M such that |x (t)| < 1 for all t £ (t f ,t") and for 
i = l,...,n. That is, the subarc is contained in the interior of F. 
Assume that on this subarc there are k components of u(t) such that 
|u_.(t)| = 1. Let u(t) be the subvector of u consisting of all its 
components where |u.(t)| = 1, and u(t) consist of those components 


for which |u^(t) | < 1, Then u will be a k-vector and u an 
(m - k) -vector* Let B consist of those columns of B corresponding 
to components of u that are in u, and let B contain the columns of 
B corresponding to elements of u that are in u* Then B will be an 
n x k matrix and B an n x (m - k) matrix. 

In chapter 2 we observed that the necessary conditions for 
solutions in the interior of T are the same as those for the problem 
with an unrestricted state region* Therefore, all results stated 
here hold also for the same problem with T « R n since all results 
in this section follow from those necessary conditions. One of 
these is the well known bang - bang principle. 

We shall consider A to be a row vector so that multiplication 
of A by a matrix will occur on the right* As before, x and u will 
be thought of as column vectors and will be multiplied on the left 
by matrices. 

Theorem 5*1 * On each such subarc A(t) satisfies the equations 


A + AA = 0 and AB = 0. 


Proof : For this problem H(t,x,u) = A q + A® (Ax + Bu) and by definition 

Bu = BTT + Bu. Applying theorem 3.6, we obtain that along each subarc 
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of a solution A + H =0 and H =0 for all j such that |u.| < 1 

* "j J 

or, in other words. « 0, Now H = AA and = AB. Therefore, 

A 4- AA = 0 and AB = 0 . 

Theorem 5.2 . For all t e (t ? ,t M ), A(t) 4 0 and 

nzl 

A (t) e f\ N t (A 1 !*) . 
i=0 L 

Proof: From theorem 5.1 we have that A(t) satisfies A = - AA and 

• ~ » 

AB = 0. Differentiating we obtain AB = 0 and substituting for A 

yields that AB = - AAB - 0 along the subarc. Continuing this process 

o ~ 9 i ~ 

we obtain -AAB = AA B = 0 along the subarc, and in general AA B = 0 
for i = 0,1,..., n - 1. Therefore, 

n^l 

A (t) e N (A X B). 
i=0 

If A(t) =0 for some t, then since by corollary 3.4 H = 0 along a 
solution, we would have at that time 0 = H = A + A* (Ax + Bu) = A . 

So (A q , A( t)) = 0 which contradicts the result of theorem 3.6 that 
(A^^ACt)) 4 0 for all t. Therefore, A(t) 4 0 for all t e (t f ,t n ). 
Corollary 5.1 . The rank of the n x n(m - k) matrix [B,AB,...,A n "*"B] 
is less than n. 

N (A 1 ^) on the subarc. That is, 

JL 

i_ ^ ^ j ^ 

AA B = 0 for i = 0,1,. ..,n - 1 or equivalently A[B,AB,...,A B] = 0. 
But since A 4 0 this means that the rank of the above matrix must be 
less than n. 

By using this corollary we can determine which combinations of 


Proof: By theorem 5.2, A e 

i=0 
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components of u can possibly have absolute value one on a subarc. 

If 9 for a given choice of B, the rank of the above matrix is n, then 
we can never have a solution to the problem with such a u and u on 
any subarc. 

Definition 5.1 . The matrices A and B satisfy the general position 
condition of order where 1 ^ q _< m provided whenever Q is an n x q 
matrix consisting of q columns of the matrix B, then the n x nq matrix 
[Q ,AQ,A 2 Q , . . . ,A n 1 q] has rank n. If A and B satisfy the general 
position condition of order 1, then we simply say that they satisfy 
the general position condition . 

Theorem 5.3 . A and B satisfy the general position condition of order 
q if and only if for all matrices Q consisting of q columns of B, we 
have 

n-1 

O N (A 1 Q) = {0} 

W L 

n-1 

Proof: Suppose a e N (A 1 Q) for some Q. Then aA 1 Q = 0 for 

i=0 L 

i = 0,...,n - 1, and hence 

a[Q,AQ A n-1 Q ] = 0 

which implies that a = 0 by the general position condition of order q. 
n^l i 

Therefore f | N (A Q) = {0}. If the general position condition of 
i=0 L 

order q is not satisfied, then there exists a 4 0 such that for some Q, 
a[Q , AQ , . . . , A n ^Q] = 0 which implies that aA^Q = 0 for i = 0 , . . . ,n - 1. 



a contradiction,, Hence 


n--l 

But this means that a e f\ N (A Q) = {0}, 

1=0 L 

the general position condition of order q is satisfied. 

Theorem 5.4 . If A and B satisfy the general position condition of 
order q, then A and B satisfy the general position condition of 
order q ? for all q* such that q <_ q ' <_ m. 

Proof : Let Q T be an n x q 1 matrix consisting of q T columns of B. 

Then since q _< q T , let Q be the n x q matrix consisting of the first 

U"" 1 

q columns of Q r , By the hypothesis the matrix [Q,AQ,...,A Q] has 

^ ^ 

rank n„ Therefore, [Q 1 ,AQ 1 , . . . , A Q T ] has rank n also since the 
previous matrix consists of a subset of the columns of this matrix. 
Hence A and B satisfy the general position condition of order q ! . 
Theorem 5*5 . Suppose A and B satisfy the general position condition 
of order q. Then along any subarc of a solution interior to F, at 
least m - q + 1 controls have absolute value one. That is, 
k >_ m - q + 1 . 

Proof : We shall argue by contradiction. Suppose k <_m - q, then 

q <_ m - k, in which case by theorem 5.4, we have that A and B satisfy 
the general position condition of order m - k. But B consists of 
exactly m - k columns of B, which means by corollary 5.1 that 

rank ([B,AB A n_1 B] ) < n, 

a contradiction to the fact that A and B satisfy the general position 
condition of order m - k. Therefore, k > m - q + 1. 
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Corollary 5.2 . (bang - bang principle) If A and B satisfy the general 
position condition, then along any subarc of a solution interior to 
r we have that |u_.| = 1 for j = l,...,m. 

Proof : By theorem 5.5, using q = 1, we obtain k^m-l+l = m, 

and since k can be at most m, we thus have k = m. Therefore, all 
components of the control vector have absolute value one. 

Corollary 5.3 . If A and B satisfy the general position condition of 
order q where 1 q m, then the controls along any sub arc interior 
to r always lie on the boundary of ft. 

Proof: By theorem 5.5, k^_m - q + l^_m-m+l = 1. Therefore at 

least one component of u has absolute value one. Hence u lies on 
the boundary of ft. 

(Note: if q = m, then the hypothesis of corollary 5.3 is the so-called 
controllability condition.) 

Corollary 5.4 . If there is a subarc of the optimal trajectory on 
wtiich the control in interior to ft, then A and B do not satisfy the 
general position condition of any order. 

We shall now develop a method for determining which components 
of u must have absolute value one on every subarc of x(t) which is 
interior to T . 

Designate the columns of B by bj,b 2 ,...,b and define 

M. = [b ,Ab ,A 2 b . , . . . ,A n ^b . ] 

J j j J J 


for j 


1 , . . . , m . 


Clearly each M 


j 


is an n x n matrix. 



Notice that: 


(a) A and B satisfy the general position condition if and only if 

each M. has rank n. 

1 

(b) A and B satisfy the general position condition of order q if 

and only if each n x nq matrix consisting of precisely q of 

the M. T s has rank n. This follows because if Q = [b . , . • • ,b . ] 

J Jl J q 

then [Q,AQ,...,A Q] is just [M. ] with its columns 

2 1 J q 

rearranged. Therefore both matrices have the same rank. 

Theorem 5.6 . Suppose M has rank n for some j where 1 £ j m, then 
| Uj | = 1 on any optimal subarc which is interior to T. 

Proof ; Suppose |u | < 1 on some subarc interior to T. Then on that 
subarc from theorem 5.1 we have 


n-1 

A e N l (A X B) 


i=0 


where B contains column b. since u. < 1. Since M. has rank n, it 

J J J 

~ *“* n~ 1 ~ 

follows that [B,AB,...,A B] has rank n because consists of a 
subset of the columns of this matrix. This was shown in (b) above. 
Irl 


Therefore 


n 

i=0 


N (A X B) = {0.:, which means that X = 0 on this subarc, 


a contradiction. Thus we must have that | u^ | =1. 

Corollary 5,5 . If |u.| < 1 on some subarc interior to F, then M. 

J J 

has rank less than n. 
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Theorem 5.7. On each subarc of x(t) interior to T, 




-1 provided Xb^< 0 


+1 provided Ab^ > 0 


for j = 1,, 


* Til# 


Proof: According to theorem 5.1, Ab_. = 0 for all j such that is 

in u. Thus, we need only investigate the u_. that are in u, that is, 

the ones that have absolute value one. According to corollary 3.5, 

each component of H_*TT must be non-negative. But * AB; therefore 

this condition requires that Ab_.u^. 0 for each u^ in u. , Hence 

when Ab . > 0, u. = +1, and when Ab . < 0, u, 3 -1. 
j J J J 


5.2 Sub arcs Along the Boundary of the State Region 
We shall now consider a sub arc of x(t) from time t’ to time t" 
such that |x_^(t) | - 1 for some i such that 1 < i < n and for all 
t e (t f ,t M ). That is, the subarc is contained in the boundary of F. 
Assume that along this subarc there are H components of x such that 
x, = 1. Let x be the Subvector of x consisting of all x. such 

l l 

that x. = 1, and x contain all x. such that x. < 1 on the subarc. 

1 i 1 ’ l 1 l 

Then x is an Jl-vector and x an (n - £) -vector. Define u and u as 
in section 5.1. 

Now let A consist of all columns of A corresponding to components 
of x in x, and A contain all columns of A corresponding to components 
of x in x. Subdivide A into A 1 and A 3 , A 1 consisting of all of its 
rows corresponding to the x_^ in x and A^ containing all rows 
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corresponding to the x^ in x. Similarly, subdivide A into A 2 and A^. 

Then: A is an n x £ matrix; 

A is an n x (n - Z) matrix; 

Aj is an J l * l matrix; 

A 2 is an Z x (n - Z) matrix; 

A 3 is an (n - Z) x Z matrix; 

A 2 is 4 n (n - Z) x (n - Z) matrix. 


Define B and B as in section 5.1, and subdivide B into Bj and 

Bg. Let B consist of all rows of B corresponding to components of 

_ 1 > 

x in x, and Bg contain all rows corresponding to the x^ in x. 
Subdivide B into B and B 4 similarly. 

Then: B is an n * k matrix; 

B is an n x (m - k) matrix; 

Bj is an Z x k matrix; 

? 2 is an Z x (m - k) matrix; 

Bg is an (n - Z) x* k matrix; 

B 4 is an (n - Z) x (m - k) matrix. 


Theorem 5.8 . On any subarc along the boundary of T, B^ has rank Z. 
Proof : By theorem 3.2, along this subarc M(t) must have rank Z. 


Recall that M(t) consists of the rows of f diag (cos z) corresponding 
to x^ for which |x^| = 1. Therefore M(t) = f^ diag (cos z) . For 
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u_. in u, cos z_. 4 0, so by dividing each column of diag (cos z) by 
the appropriate cos z ^ 4 0 , we obtain 

rank (M(t)) = rank (f~ diag (cos z)) = rank (f-J . 

Now f = Ax + Bu = Ax + BU + Bu, so f~ - B. Eliminating all rows 
corresponding to x, in x, we obtain f~ = B^. Therefore, 

& = rank (M(t)) = rank (B^) . 

Theorem 5,9 . On each subarc lying on the boundary of r, x(t) ,u(t) and 
A(t) satisfy the equations 

Tb 2 + xb 4 - 0 
A + XA 0 + AA = 0 , 

2 4 * 

A-^x + A^x + B^ + B^u = 0, 

A 3 x 4- A 4 x + B 3 u + B 4 u = x, 

A + A(A x + A, x + B u + B. u) = 0. 
o 3 4 3 4 

The last equation holds if u is differentiable on the subarc. 

Proof: For this problem 

H = A + A • (Ax + Bu) 

o 

= A q + A • (AX + Ax + BTT + Bu) 

= A^ + AA^x + AA 3 x + AA^x + AA 4 x 

+ AB^u + AB^u ^ AB^u ^ ^4^* 

Hence, applying theorem 3.6, 

H. - Ib o + AB, =0 and 

u 2 4 

* ,• ^ 

A + H,. = A + AA + AA =0. 
x 2 4 
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So, the first two equations are proved. For each x^ in x, x_^ = 0 
since x^ is constant along the subarc. Thus, f = 0 along the 
subarc. But 


f = Ax + Bu = Ax+Ax+Bu + Bu, 

and hence f = + A^x + B^u + B^u « 0. For the other components 

x = f = Ax H-Ax + Bu+Bu are the state equations . If u is 
3 4 3 4 

differentiable on the subarc, then we can apply corollary 3.4 to 
obtain H = 0 along the sub arc. Now by subtracting from the expansion 
of H above the terms in the third equation, we obtain 

H = A + AA x + AA x + AB u + AB. u 
0 3 4 3 4 

= A 4- A(A x + A ( x+Bu+Bu) =0. 

0 3 4 3 4 

Theorem 5.10 . Let A f = A(t'), then for all t e (t 1 , t") ,A(t) ^ 0 and 

X(t) - 5* e - c ">, 

“(t) = - A(t)B 4 B^K, 

. A(t) s N l (B 4 - B 4 B^KB 2 ), 

where K = (B^B^) ^ . 

T 

Proof ; Since B^ has rank l t B^B^ is an invertible £ x z matrix. 

Thus, K is well defined. By theorem 5.8, AB 2 = -AB 4 and so 

AB 2 B 2 T = - AB 4 B^ which implies that A = - AB 4 B 2 K. Also, by theorem 

5.8, A * - AA^ - AA^. Therefore, substituting for A, we obtain 
~ t 

A « A(B 4 B 2 KA 2 - A ) which has the unique solution 
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Mt) 


e ( Vl“ 2 * V <t 


t ? ) 


on the subarc. Also, if A(t) = 0 for some t e (t ? ,t")> then by the 
second equation of the theorem, A(t) = 0, Moreover, by viewing the 

fifth equation of theorem 5.8, we see that A =0. Hence (A ,A) = 0 

o o 

which violates the conclusion of theorem 3.6 that (A^, A) =f 0 for all 

t e [t Q ,t ]. Thus A (t) =j* 0 for all t e (t f ,t M )o Finally if we take 

A = - AB, B^K and substitute it into AB rt + AB, = 0, we obtain 
4 2 2 4 9 

A(B 4 - B 4 B2KB 2 ) = 0. 

T 

It thus follows that A e N^(B 4 - B^B^KB^) . 

Corollary 5.6 . If k + £ = m, then A(t) e N^CB^ - B^B^KB^) is always 
satisfied. 

Proof: B^ is an £ x (m - k) matrix. Therefore, if £ - m - k, B 2 

is a square £ x £ matrix, and since it has rank £, it is invertible 

and so is B^. Hence, K = (B^bJ) ^ “ (bJ) ^ B 2 Thus 

T T T -1 -1 

B 2 KB 2 ■ B 2 (B 2 ) B 2 B^ = I, the £ x £ identity matrix. It therefore 
follows that 

B , - B B T KB = B - B =0, 

4 4 2 2 4 4 

the (n - £) x £ zero matrix, and so 

A (t) e N t ( 0) = R n_£ 

L 


is automatically satisfied for all t e (t , ,t n ). 
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Notice that in case k + i < m, the above condition is not 

T 

necessarily trivial. In this case £ < m - k which implies that 

T 

has rank less than m - k since has rank £. Hence cannot 

be the (m - k) x (m - k) identity matrix, 

T 

Corollary 5,7 , The rank of B^ - B 4 B 2 KB 2 is less than n - £. 

Proof: This follows from the two results of theorem 5.9 that A(t) 4 0 

and A(t) e N L (B 1+ - B^B^KB^ . 

T 

Theorem 5,11 . Let C - AB^B 2 K, an n * £ matrix, and form the n * n 
matrix D in the following way. For each i such that 1 ^ i if 

|x_J =1 and x_^ is the j-th component of x, then let the i-th column 
of D be the j-th column of C. If |x^| < 1 and x^ is the j-th 
component of x, then let the i-th column of D be the j-th column of 
-A. Then along this subarc, A ( t ) satisfies 

A = AD and AB = 0. 

Proof : By definition AB 2 4- AB^ « AB, so the second equation is 

satisfied by theorem 5,9. Moreover, if we differentiate AB 2 + AB^ = 0, 
we obtain 

_«_ ^ ^ ^ 

AB = - AB = ( AA + AA )B since A = - AA - AA . 

2 4 2 4 4 2 4 

Therefore, A = (AA^ + AA^)B^bJk which is equal to AAB^B^K by the 

• • 

definition of A 2 and A^ . So we have A = AC and A = A(-A), and putting 
these together according to the definition of D in the hypothesis, it 

i 

follows that A = AD. *. 
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We have now put the equations for X on a sub arc along the 
boundary of the state region into the form of theorem 5.1, which 
gives the equation for X on a subarc interior to the state region. 

Thus, the same type of logic used in theorem 5.2 may now be applied 
to obtain similar results in this case. 

n-i . 

Theorem 5.12. For all t e (t',t"), X(t) e N T (D B) . 

W L 

Proof : By theorem 5.10, X(t) satisfies the equations X = XD and XB = 0 

• ~ 

on this subarc. Thus, by differentiating we obtain XB = 0 and since 
X = XD, this implies that XDB = 0. Continuing this process, we 
obtain XDB = XD 2 B = 0 along the subarc, and in general XD^B = 0 
for i = 0,1,. ,n-l. Therefore, 


X(t) e 



Corollary 5.8 . The rank of the n x n(m - k) matrix [B,DB,...,D J 'B] 
is less than n. 

Proofs This follows from the facts that X(t) e N (D^B) , and 

i=0 L 

X(t) 4 0. 


Notice that there is a one-to-one correspondence between 
possible boundary configurations and decompositions of A and B into 
A^A^A^A^B^ and B^. By possible boundary configurations 

we mean the possible combinations of each component of u and x 
either having absolute value equal to one or less than one. The 
theory of this section was developed with the thought in mind of 
enabling one to narrow the list of possible boundary configurations 


somewhat. 



First one applies the condition that k + l ^ m to eliminate all 
cases where the total number of constraints in effect exceeds the 
number of control variables * One might observe that this is the 
same as saying that the total number of constraints not in effect 
must be greater than or equal to the number of state variables. 

This is true because if k + l <_ m then -k - l >_ -m, and if we add 
n + m to both sides of this inequality we obtain (m - k) + (n - £) >_ 
Secondly, one can apply corollary 5,7 to rule out some or all 
of the combinations where k + & < m. This is a consistency 
condition which must be satisfied if there is to be a non-trivial 
solution to the equation for A in theorem 5,8. 

Finally, one can reduce the possibilities further by applying 
corollary 5,7, 

Theorem 5,13 . For all t e [t ? ,t M ] and for all v £ ft such that 
A^xCt) + A^x(t) + B-^v + B^v = 0 , we have 

A (t) (B gU(t) + B 4 u(t)) >_ A(t) (BgV + B 4 v) . 

Proof % From the proof of theorem 5.9, we have that if 


A_x(t) + A x(t) + B v + B n v = 0, 
1 2 12 


then 


H(x(t) ,v,A(t)) = + A(t)(A 3 x(t) + A^x(t) + B 3 v + B 4 v) . 


Thus, applying theorem 3,11, it follows that 
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A q + A(t)(A 3 x(t) + A 4 x(t) + B 3 u(t) + B 4 u(t)) 


+ A(t)(A 3 2 T(t) + A 4 x(t) + B 3 v + B 4 v) , 


and therefore, by subtracting out the common terms on each side of 
the inequality, we obtain 


A(t)(B 3 u(t) + B^u(t) ) >_ A (t) (B 3 v + B 4 v) 


theorem 5,14 , On each sub arc of x(t) along the boundary of F, 

/ 




-I provided Ab_. < 0 


‘+1 provided Ab_. > 0 


for j * l s .. M m where b^ is in the j-th column of the matrix B. 
Proof ; According to theorem 5.11, Ab^ = 0 for all j such that u^ 
is in u. Thus , we need only investigate the that are in u, that 
is s the ones that have absolute value one. According to corollary 


3.5, each component of H— *u must be non-negative. But frj = AB and 

therefore this condition requires that Ab.u. > 0 for each u. in u. 

j j “ J 

Hence, when Ab , > 0 then u. = +1 and when Ab < 0, u. = -1. 

2 2 2 2 

So we see that the results of theorem 5.7 hold even in the case 


where x(t) lies on the boundary of F. 
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6. SUMMARY AND CONCLUSIONS 

In this thesis a method has been described whereby an optimal 
control problem with bounded state variables may be transformed into 
an equivalent Lagrange problem. This was accomplished by means of 
differentiable mappings which take some Euclidean space onto the 
closecT and bounded control and state regions. Whereas all such 
mappirigs lead to a Lagrange problem, it has been shown that only 
those which were defined as acceptable pairs of transformations are 
suitable in the sense that solutions to the Lagrange problem lead 
to solutions to the bounded state problem and vice versa. In 
particular, an acceptable pair of transformations was exhibited for 
the case when the control and state regions are right parallelepipeds. 

The necessary conditions of the Calculus of Variations were 
then applied to the transform of the non-linear, non-autonomous 
optimal control problem with the unit n-cube and unit m-cube as 
state and control regions respectively. These conditions are the 
Euler Lagrange equations, corner conditions, transversality conditions, 
tlje Clebsch condition and the Weiers trass condition. In each case 
aitalogous conditions were developed which solutions of the bounded 
state problem must sdtisfy. In addition, the hypothesis for applying 
these conditions led to theorems which state that the initial and 
terminal points must be interior to the state region, and that a 
certain submatrix of the matrix of partials of the constraining 



57 


differential equations with respect to the control variables must 
have maximum row rank. 

These results were than applied to a simple example of a 
linear time optimal control problem with bounded state variables 
for which solutions were completely characterized. The general 
linear time optimal problem was then considered and various 
necessary conditions developed for it in terms of the coefficient 
matrices • 

First, the properties of sub arcs of a solution interior to the 
state region were developed. It was observed that these must also 
hold for the non-state constrained problem. Among them is the well 
known bang - bang principle. Then subarcs which lie along the 
boundary of the state region were investigated with several new 
necessary conditions developed. 

There are a number of areas in which the author feels that the 
research of this thesis may be extended to yield further results. 

If acceptable pairs of transformations can be exhibited for more 
general state and control regions, perhaps convex polyhedra or even 
compact convex sets, then the same analysis can be applied to obtain 
necessary conditions. Of particular interest would be state regions 
described by a system of inequality constraints of the form g(x) <_ 0. 

The question of sufficiency has been totally ignored in this 
thesis. Although there are a number of very difficult problems 
associated with the development of sufficient conditions for the 
bounded state problem, the author feels that some results may be 
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obtained by using this transformation method to apply the sufficient 
conditions of the Calculus of Variations to the bounded state 
problem. The question of existence of solutions may possibly be 
studied by applying the theory of generalized curves to the trans- 
formed problem. 

It was shown in chapter 3 that state boundary subarcs are 
singular subarcs of the transformed hagrange problem. Therefore, 
an investigation of the known properties of singular subarcs may 
lead to new necessary conditions for the bounded state problem. 

It was observed that a number of the results of chapter 3 are 
similar to those obtained by other authors. Further studies should 
be made to determine the exact relationship between the results 
contained herein and those of others, notably Berkovitz and 
Gamkrelidze. 

In this formulation, sectionally continuous controls and 
sectionally smooth trajectories were considered. The same theory 
could have been developed using bounded measurable controls and 
absolutely continuous trajectories. Also, other types of problems 
could be investigated. Perhaps the best place to start would be to 
consider the linear problem with different cost functions, for 
example, a quadratic cost function. 

Finally, some experimentation should be made, possibly on a 
computer, in the utilization of the conditions developed to obtain 
solutions to particular problems. 
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